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REMARKS 



This Preliminary Amendment is requested prior to the initial examination of the 
above-identified patent application to address minor matters of form and syntax. 

If the Examiner has any comments or suggestions, which could place this 
application in even better form, the Examiner is requested to telephone the undersigned 
attorney at the below-listed number. 

Respectfully submitted, 



Date: Ob /Jyl -2**=* 



Rader, Fishman & Grauer PLLC 

Suite 501 

1233 20th Street, N.W. 
Washington, D.C. 20036 
Telephone: (202) 955-3750 
Facsimile: (202) 955-3751 

Customer No. 23353 

Should additional fees be necessary in connection with the filing of this paper, or if a petition for 
extension of time is required for timely acceptance of same, the Commissioner is hereby 
authorized to charge Deposit Account No. 180013 for any such fees; and applicant(s) hereby 
petition for any needed extension of time. 




David K. Benson 
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A ppendix I 

^'Amendment to the Specification 



Beginning^rpage 3, line 8: 
It is therefore an object of the present invention to provide a speech transformation 
method and apparatus whereby the results of speech recognition can be presented in a 
noise-reduced state depending on the bodily state, the using state or the objective of using 
of the hearing-impaired persons. 

Beginning at page 3, line 12: 

It is another object of the present invention to provide a speech transformation 
method and apparatus whereby a person with voice-speech disorders ascribable to 
laryngectomy, tongue/mouth-floor resection or articulation disorder etc. is able to utter with 
spontaneous speech proper to the user or subject to spontaneous transformation and 
whereby the external speech is outputted to the user to enable spontaneous conversation. 

Beginning at page 6, line 3: 

The present invention is applied to an HA (hearing aid) 1 configured as shown for 
example in Figs. 1 and 2. This HA 1 is of a portable type and includes an HMD (head- 
mounted display) 2, and a computer unit 3 for speech recognition and for generation of the 
speech information eta, interconnected over an optical fiber cable 4, as shown in Fig. 1. 
The computer unit 3 is annexed to a support 5 worn on e.g., a user's waist and is driven by 
power supplied from a battery 6 annexed to the support 5, while driving the HMD 2. 

Beginning at page 6, line 15: 

The display unit 7, arranged just ahead of the user's eye, demonstrates the 
meaning or the contents of the speech detected by the microphone for a user 8 and/ or by 
the microphone for outside 1 1 which will be explained subsequently. It is also possible 
for the display unit 7 to demonstrate not only the meaning and the contents of the speech 
as described above but also the other information responsive to commands from the 
computer unit 3. 
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Beginning at page 7, line 7: 

The microphone for a user 8 and the microphone for outside 1 1 may be designed 
as processor for acoustic signals, such as acoustic echo-canceller, a microphone array or 
a variety of microphones, irrespective of the mounting positions depending on the user's 
operation . These microphones may be exemplified by these types (pressure-type, 
pressure-gradient-type, parametric-type, laser Doppler-type, bone-conduction-type, 
ultra-small two-way unit picking up air-conduction and bone-conduction sound 
(manufactured by NTT), non-directional-type, uni-directional (ultra-directional)-type, 
bi-directional-type, dynamic-type, condenser (electret)-type, zoom-type, stereo-type, MS 
stereo-type, wireless-type, ceramic-type and magnetic-type). 

Beginning at page 8, line 21: 

The computer unit 3 starts the program stored in the recording medium, based on 
the electrical signals generated from the speech detected by the microphone for a user 8 
and/or the microphone for outside 1 1 and performs speech recognition processing in the 
CPU to produce the results of recognition. This permits the computer unit 3 to obtain the 
contents of the speech detected by the microphone for a user 8 and/or the microphone for 
outside 1 1 in the CPU . 

Beginning at page 14, line 3: 

The speech information generating unit 23 works on, transforms or synthesizes 
etc. the results of recognition from the signal processor 22, depending on the bodily state, 
the using state or the using objectives of the user. The speech information generating unit 
23 also performs the processing of presenting the speech detected by the microphone 21 
to the user on the results of recognition or the results of recognition obtained on working 
etc. 

Paragraph beginning at page 14, line 9: 

It is also possible for the speech information generating unit 23 to modify the 
speech information generated from the results of recognition to generate the new speech 
information. At this time, the speech information generating unit 23 annexes words or 
phrases more familiar to the user, based on the bodily state, the using state or the using 
objectives of the user, to improve further the speech recognition by the user. For 
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example, if "BIG MAC'YTrademark) ["Big Mac"] is inputted to the microphone 21, the 
speech information generating unit 23 doing this processing generates e.g., the speech 
information indicating "MACDONALD'S BIG MAC" ["MacDonald's Big Mac] 
(Trademark). 

Beginning at page 20, line 3: 

The speech enhancement unit 25, adapted for outputting the speech to the user's 
ear, may be of the dynamic speech enhancement transducing system or electro-static 
speech enhancement transducing system (capacitor or electrostatic type), and of the 
shape of a headphone [such as the in-the-air type (open-air, closed or canal type)] such as 
the open-air, closed, or in-the-ear type such as canal type . On the other hand, the speech 
enhancement unit 25 may be that used in a conventional HA, speech enhancement unit or 
sound collector speaker, may be of the type employing a magnetic loop or may be of a 
microphone speaker system employing fingers (prototype: [Whisper] WHISPER (NTT 
Docomo)). After all, the speech enhancement unit 25 outputting the speech from the user 
to the speaker may be any suitable conventionally used speech enhancement unit or 
speaker. 

Beginning at page 29, line 5: 

When synthesizing speech data, the speech information generating unit 23 is able 
to execute speech synthesis by rule irrespective of the speech contents , speech synthesis 
employing variable length unit to synthesize the smooth speech, prosody control to 
produce natural speech on synthesis or quantizing conversion for furnishing the speech 
with personality[, irrespective of the speech contents,] in order to generate the speech 
information (see "A perspective in automatic telephone interpretation" Advanced 
Telecommunication Res. Ins. ATR Advanced Tech. Series, Ohmsha, 177-209, 1994). 

Beginning at page 30, line 18: 

It is also possible for the speech information generating unit 23 not only to 
perform the processing of interpreting the language from the speech, using the result of 
recognition, and of constructing the speech information from the speech data using the 
so-interpreted language, but also to perform the processing of working on or 
transforming the language from other processing interpreted on the basis of the results of 
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recognition. That is, it is possible for the present speech information generating unit 23 to 
construct the speech information and to perform the voice speed conversion of changing 
the rate of outputting the speech information [at the speech information generating unit 
23] to the speech enhance unit 25 , such as by elongating the voiced domain by splitting 
or elongating the pitch domain, not working on the invoiced domain, or shortening the 
non-speech domain. That is, this voice speed conversion is performed by selecting 
appropriate voice speed depending on the user's state. 

Beginning at page 34, line 10: 

If the speech information generating unit 23 has heard an alarm bel l a siren of an 
ambulance car or a siren informing of oncoming tsunami, it not only demonstrates that 
effect but outputs from the speech enhancement unit 25 such announcement as "it's fire", 
"ambulance car has arrived" or "tsunami is oncoming" with a large volume, while 
demonstrating an image representing the fire, ambulance car or tsunami on the display 
unit 26. 

Beginning at page 38, line 13: 

It is moreover possible for the signal processor 22 and the speech information 
generating unit 23 to perform speech recognition only for a specified noise, in addition to 
generating the result of recognition only of the speech of the speaker to generate the 
speech information for presentation to the speech enhancement unit 25 and/or the display 
unit 26 and thence to the user. In sum, the signal processor 22 and the speech information 
generating unit 23 performs speech recognition processing on the input sound to 
transform the result of recognition depending on the bodily state, the using state or the 
using objectives of the user, to generate the speech information in expressions readily 
comprehensible for the user to output the generated speech information. 

Beginning at page 47, line 6: 

The HA 1 is not limited to an embodiment in which the speech recognition 
processing is executed by the signal processor 22 using the speech from the microphone 
21 etc. For example, the speech recognition processing may also be executed using 
signals from a nasal sound sensor, respiratory air-stream sensor, neck- vibration sensor, 
bone vibrator, such as mouthpiece-type vibrator, worn by the user or by a person other 
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than the user, or using signals from the microphone 21 . Thus, with the HA 1 , the rate of 
recognition by the signal processor 22 may be improved further by employing respective 
sensors in addition to the microphone 21. 

Beginning at page 47, line 14: 

The HA 1 may also be provided with a camera mechanism 29 for photographing 
a moving picture or a still image by a digital camera carrying e.g., an automatic 
focussing mechanism or a zooming function, to display it on the display unit 26, as 
shown in Fig.2. This camera mechanism 29 may be loaded as one with the display unit 7 
of Fig. 1 . The camera mechanism 29 may use a digital camera. 

Beginning at page 48, line 3: 

The HA 1 demonstrates a photographed image on the display unit 26 from the 
camera mechanism 29 through a signal processing circuit. With the HA 1, the image 
which has photographed the speaker by the camera mechanism 29 is presented to the user 
to improve the user's recognition. It is possible with this HA 1 to output the 
photographed image through the communication circuit 27 to an external network. It is 
also possible with the HA 1 to [demonstrate] input the image photographed by the 
camera mechanism 29 from the external network [by inputting the image] and display the 
image on the display unit 26 through the communication circuit 27 and the signal 
processing circuit etc. 

Beginning at page 50, line 10: 

It is also possible with this HA 1 to photograph a still image by pressing a shutter 
as in a digital camera fort a still image. It is also possible [or] for the camera mechanism 
29 to generate a moving picture to output the so-generated moving picture to the signal 
processor 22. As a signal system for photographing a moving picture by this camera 
mechanism 29, an MPEG (Moving Pictures Experts Group) system may be used. The 
camera mechanism 29 provided in the HA 1 is able to photograph a 3D-image to 
photograph the speaker or the speaker's lip to demonstrate it on the display unit 26 to 
improve the user's recognition further. 



-13- 



Appl. Serial No 



[73,360 



Attr. 




;ket Number: KOI-046 



Beginning at page 51, line 4: 

The HA 1 may also be provided with e.g., a switch mechanism so that the user is 
free to control whether the speech detected by the microphone 21 is outputted to the 
[speaker] speech enhancement unit 25, an image photographed by the camera mechanism 
29 is to be outputted to the display unit 26 or both the speech and the image are to be 
outputted. At this time, the switch mechanism, acted on by the user, controls the output 
from the speech information generating unit 23. 

Beginning at page 53, line 2: 

In the foregoing explanation of the HA 1, an exemplary processing of outputting 
the speech information as the speech is explained. However, this is merely illustrative. 
For example, the HA 1 may be of the type in which the result of recognition is presented 
to the user by the middle ear implant. That is. in this HA 1. the speech information may 
be presented to the user as electrical signals through a coil or a vibrator. 

Beginning at page 53, line 17: 

It is also possible for this HA 1 to modulate, work on or transform the result of 
recognition and the result of recognition worked or transformed into the speech in the 
ultrasonic range, as speech information, to output the resulting information^ The result 
of recognition may be or may not be worked on or transformed,] depending on the bodily 
state, the using state or the using objectives of the user, for example, depending on 
whether or not the hearing-impaired user is able to recognize the speech in the ultrasonic 
range. The HA 1 may also be provided with a bone conduction ultrasound system to 
generate signals in the ultrasonic frequency range to output the generated signal to the 
user through e.g., an ultrasonic vibrator (see "Activation of the auditory cortex by 
ultrasound" Hosoi H., et al., Lancet Feb 14351 (9101), 496-7, 1998). 

Beginning at page 70, line 1 1 : 

The HA 1 can be used as a support for works mainly of hearing-impaired persons 
and persons with speech disorders in office work {as a wearable computer}, 
authentication, voice-speech training, conference, reception by telephone or internet, 
program making, such as animation program, real scenes, news or music program, work 
in the space, transportation (pilot of space ship and airplane), various simulation works, 
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employing VR and AR, operation by remote control (micro-surgery, research 
(marketing), military works, design works, work at home, operations under hostile 
conditions, such as under noise, such as works in construction site or plants, sorting 
works etc. 
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AU6 0 S fflffl ^ A ppendix II 



lendment to the Claims 



10. (Amended) The speech transformation apparatus according to claim 1 
wherein said output means is a [cochlear] middle ear implant mechanism and wherein 

said output control means generates a control signal for outputting the result of 
recognition and/or the result of recognition worked on or transformed as electric signal 

27. (Amended) The speech transformation apparatus according to claim 25 
wherein said acousto-electric transducing means detects the speech uttered by a person 
with voice-speech disorders, using a technique used for correcting the voice-speech 
disorders, other than the speech production substitutes, as the speech uttered with [voce] 
voice -speech disorders, to generate speech signals. 
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